Speech recognition, sylabification and statistical phonetics
نویسنده
چکیده
The classical approach in phonetics of careful observation of individual utterances can, this paper contends, be usefully augmented with automatic statistical analyses of large amounts of speech. Such analyses, using methods derived from speech recognition, are shown to quantify several known phonetic phenomena, most of which require syllable structure to be taken into account, and reveal some apparently new phenomena. Practical speech recognition normally ignores syllable structure. This paper presents quantitative evidence that prevocalic and postvocalic consonants behave differently. It points out some ways in which current speech recognition can be improved by taking syllable boundaries into account.
منابع مشابه
Dependence and independence in automatic speech recognition and synthesis
When automatically recognising or synthesising speech by computer, we are forced to make a number of assumptions of statistical independence in order to make certain problems tractable. This paper gives a few examples of how phonetic knowledge is already usefully informing these decisions about independence, and a few examples of where it isn’t, yet. Temporal integration – how information from ...
متن کاملSpeech Recognition Based on Syllable and Pseudo-articulatory Features
The prevailing approach to speech recognition is the statistical technique known as hidden Markov modeling (HMM), which is capable of reasonable performance in general usage (~95%) – but not much more. The major drawback is that it ignores phonetics, which has the potential for going beyond the acoustic variations to provide a more abstract underlying representation. Also, HMM only produces a s...
متن کاملAutomatic Phonetic Transcription of Non − Prompted Speech
Automatic Segmentation" (MAUS) system labels and segments the phonetic constituents of spoken German in a manner similar to highly trained phoneticians. MAUS has been used to train automatic speech recognition (ASR) systems as well as to provide detailed statistical analyses of spontaneous speech (using the Verbmobil I and RVG I corpora). The MAUS system is a reliable, automatic means of testin...
متن کاملPhonetics and Speech Technology
Is there a need to apply phonetics in speech technology development? How can phonetic thinking influence the quality of the final product (synthesised speech, speech recognition)? What happens if phonetic aspects are not used? What branches of phonetics are used, and what is to be used in speech technology? How phonetic thinking can be embedded into the development procedure of a speech technol...
متن کاملSpeech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers
In spite of decades of research, Automatic Speech Recognition (ASR) is far from reaching the goal of performance close to Human Speech Recognition (HSR). One of the reasons for unsatisfactory performance of the state-of-the-art ASR systems, that are based largely on Hidden Markov Models (HMMs), is the inferior acoustic modeling of low level or phonetic level linguistic information in the speech...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004